Discovery of large genomic inversions using pooled clone sequencing

نویسندگان

  • Marzieh Eslami Rasekh
  • Giorgia Chiatante
  • Mattia Miroballo
  • Mario Ventura
  • Chris T. Amemiya
  • Evan E. Eichler
  • Francesca Antonacci
  • Can Alkan
چکیده

Motivation: There are many different forms of genomic structural variation that can be broadly classified into two groups as copy number variation (CNV) and balanced rearrangements. Although many algorithms are now available in the literature that aim to characterize CNVs, discovery of balanced rearrangements (inversions and translocations) remains an open problem. This is mainly because the breakpoints of such events typically lie within segmental duplications and common repeats, which reduce the mappability of short reads. The 1000 Genomes Project spearheaded the development of several methods to identify inversions, however, they are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies (HTS). Results: Here we propose to use a sequencing method (Kitzman et al., 2011) originally developed to improve haplotype phasing to characterize large genomic inversions. This method, called pooled clone sequencing, merges the advantages of clone based sequencing approaches with the speed and cost efficiency of HTS technologies. Using data generated with pooled clone sequencing method, we developed a novel algorithm, dipSeq, to discover large inversions (>500 Kbp). We show the power of dipSeq first on simulated data, and then apply it to the genome of a HapMap individual (NA12878). We were able to accurately discover all previously known and experimentally validated large inversions in the same genome. We also identified a novel inversion, and confirmed using fluorescent in situ hybridization. Availability: Implementation of the dipSeq algorithm is available at https://github.com/BilkentCompGen/dipseq Contact: [email protected], [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genotyping of Intron 22 and Intron 1 Inversions of Factor VIII Gene Using an Inverse-Shifting PCR Method in an Iranian Family with Severe Haemophilia A

Abstract Background: Haemophilia A (HA) is an X-linked bleeding disorder caused by the absence or reduced activity of coagulation factor VIII (FVIII). Coagulation factors are a group of related proteins that are essential for the formation of blood clots. The aim of this study was to genotype the coagulation factor VIII gene mutations using Inverse Shifting PCR (IS-PCR) in an Iranian family ...

متن کامل

Dinoflagellate Genomic Organization and Phylogenetic Marker Discovery Utilizing Deep Sequencing Data

Title of Dissertation: DINOFLAGELLATE GENOMIC ORGANIZATION AND PHYLOGENETIC MARKER DISCOVERY UTILIZING DEEP SEQUENCING DATA Gregory Scott Mendez, Doctor of Philosophy, 2016 Dissertation directed by: Professor Charles F. Delwiche, Cell Biology and Molecular Genetics Dinoflagellates possess large genomes in which most genes are present in many copies. This has made studies of their genomic organi...

متن کامل

A clone-array pooled shotgun strategy for sequencing large genomes.

A simplified strategy for sequencing large genomes is proposed. Clone-Array Pooled Shotgun Sequencing (CAPSS) is based on pooling rows and columns of arrayed genomic clones, for shotgun library construction. Random sequences are accumulated, and the data are processed by sequential comparison of rows and columns to assemble the sequence of clones at points of intersection. Compared with either ...

متن کامل

Clone-array pooled shotgun mapping and sequencing: design and analysis of experiments.

This paper studies sequencing and mapping methods that rely solely on pooling and shotgun sequencing of clones. First, we scrutinize and improve the recently proposed Clone-Array Pooled Shotgun Sequencing (CAPSS) method, which delivers a BAC-linked assembly of a whole genome sequence. Secondly, we introduce a novel physical mapping method, called Clone-Array Pooled Shotgun Mapping (CAPS-MAP), w...

متن کامل

Evaluation of a Pooled Strategy for High-Throughput Sequencing of Cosmid Clones from Metagenomic Libraries

High-throughput sequencing methods have been instrumental in the growing field of metagenomics, with technological improvements enabling greater throughput at decreased costs. Nonetheless, the economy of high-throughput sequencing cannot be fully leveraged in the subdiscipline of functional metagenomics. In this area of research, environmental DNA is typically cloned to generate large-insert li...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015